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1. Introduction 

This article reviews a particular tool of the spectral theory of orthogonal poly- 
nomials. Let p be a measure on C with finite moments, that is, 

z\ n dfjt{z) < oo (1.1) 

for all n = 0, 1, 2, . . . and which is nontrivial in the sense that it is not supported 
on a finite set of points. Thus, {z n }^ =0 are independent in £ 2 (C, dp:), so by Gram- 
Schmidt, one can define monic orthogonal polynomials, X n (z; dp), and orthonormal 
polynomials, x n — X n /\\X n \\i,2. Thus, 

z j X n (z;dp)dfi(z) = j = 0,...,n-l (1.2) 
X n {z) — z 11 + lower order (1-3) 



x n (z) x m {z) dp = S nm (1.4) 

We will often be interested in the special cases where p is supported on R 
(especially with support compact), in which case we use P n ,p n rather than X n , x n , 
and where p is supported on dH> (D = {z \ \z\ < 1}), in which case we use $„,</?„. 
We call these OPRL and OPUC (for "real line" and "unit circle" ) . 

OPRL and OPUC are spectral theoretic because there are Jacobi parameters 
{a n , b n }^ =1 and Verblunsky coefficients {ot n }%L Q with recursion relations {p~i = 0; 
po = $ = 1): 

zp n (z) = a n+ ip n+ i(z) + b n+1 p n (z) + a n p n -x(z) (1.5) 
<D„ +1 ( Z ) = z$„(z)^„$;( Z ) (1.6) 



$*(z) = z n $ n (l/z) (1.7) 

We will sometimes need the monic OPRL and normalized OPUC recursion 
relations: 

zP n (z) = P n+1 (z) + b n+1 P n (z) + a 2 n P n _ x [z) (1.8) 

zip n (z) = p n ip n+ i(z) + a n tp*(z) (1.9) 

Pn = (1 - |a„| 2 ) 1/2 (1.10) 



Of course, the use of p n implies \a n \ < 1 and all sets of {a„}^ obeying this occur. 
Similarly, b n G R, a n S (0, oo) and all such sets occur. In the OPUC case, {a„}^L 
determine dp, while in the OPRL case, they do if sup(|a„| + \b n \) < oo, and may 
or may not in the unbounded case. For basics of OPRL, see [931 122L 1341 189] ; and 
for basics of OPUC. see [93l 1571 l34l l80l I5H 179] . 

We will use re n (or K n (dii)) for the leading coefficient of x n , p n , or (p n , so 

= \\X n \\-J (d ^ (1.11) 

The Christoffel-Darboux kernel (named after |23L I28| 1 is defined by 



K n {z,Q = >x j (z)x J (0 (1.12) 
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We sometimes use K n (z 7 £; [i) if we need to make the measure explicit. Note that 
if c> 0, 

K n (z,(;cfi) = c^ 1 K n (z,C;^) (1.13) 

since x n (z;cdfi) = c^ 1 ' 2 x n (z; djj). 

By the Schwarz inequality, we have 

\K n (z,()\ 2 <K n (z,z)K n ((,() (1.14) 

There are three variations of convention. Some only sum to n — 1; this is the 
more common convention but (|1.12p is used by Szego [93) . Atkinson [5], and in 
1801 181 j . As we will note shortly, it would be more natural to put the complex 
conjugate on x n (Q, not x n (z) — and a very few authors do that. For OPRL with 
z, £ real, the complex conjugate is irrelevant — and some authors leave it off even 
for complex z and £. 

As a tool in spectral analysis, convergence of OP expansions, and other aspects 
of analysis, the use of the CD kernel has been especially exploited by Freud and 
Nevai, and summarized in Nevai's paper on the subject [68j. A series of recent 
papers by Lubinsky (of which |60|, I61j are most spectacular) has caused heightened 
interest in the subject and motivated me to write this comprehensive review. 

Without realizing they were dealing with OPRL CD kernels, these objects have 
been used extensively in the spectral theory community, especially the diagonal 
kernel 

n 

K n (x,x)=Y,\Pj(x)\ 2 (1-15) 

Continuum analogs of the ratios of this object for the first and second kind poly- 
nomials appeared in the work of Gilbert-Pearson [38j and the discrete analog in 
Khan-Pearson [49] and then in Jitomirskaya-Last [45] . Last-Simon [55j studied 
—K n (x, x) as n — > oo. Variation of parameters played a role in all these works and 
it exploits what is essentially mixed CD kernels (see Section [5]). 

One of our goals here is to emphasize the operator theoretic point of view, 
which is often underemphasized in the OP literature. In particular, in describing 
/i, we think of the operator M z on L 2 (C, d/i) of multiplication by z: 

(M z f)(z) = zf(z) (1.16) 

If supp(d^i) is compact, M z is a bounded operator defined on all of L 2 (C, d/i). If it 
is not compact, there are issues of domain, essential selfadjointness, etc. that will 
not concern us here, except to note that in the OPRL case, they are connected to 
uniqueness of the solution of the moment problem (see |77) ) . With this in mind, 
we use <i(dfi) for the spectrum of M z , that is, the support of <i/x, and <7 ess (<i/i) for 
the essential spectrum. When dealing with OPRL of compact support (where M z 
is bounded selfadjoint) or OPUC (where M z is unitary), we will sometimes use 
^a.c{d^), <r sc (d(j,), er p p(d/z) for the spectral theory components. (We will discuss 
(JessC^A 4 ) om y m the OPUC/OPRL case where it is unambiguous, but for general 
operators, there are multiple definitions; see |31j.) 

The basis of operator theoretic approaches to the study of the CD kernel de- 
pends on its interpretation as the integral kernel of a projection. In L 2 (C, dfj,), the 
set of polynomials of degree at most n is an n + 1-dimensional space. We will use 
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(1.17) 



The order of z and ( is the opposite of the usual for integral kernels and why we 
mentioned that putting complex conjugation on £«(£) might be more natural in 



often called the reproducing property. 

One major theme here is the frequent use of operator theory, for example, 
proving the CD formula as a statement about operator commutators. Another 
theme, motivated by Lubinsky [60 1 161) . is the study of asymptotics of -^K n (x,y) 
on diagonal (x = y) and slightly off diagonal ((x — y) = 0(-^)). 

Sections [2] [3] and [6] discuss very basic formulae, and Sections [4] and [7] simple 
applications. Sections [5] and discuss extensions of the context of CD kernels. 
Section [9] starts a long riff on the use of the Christoffel variational principle which 
runs through Section 1231 Section [24] is a final simple application. 

Vladimir Maz'ya has been an important figure in the spectral analysis of partial 
differential operators. While difference equations are somewhat further from his 
opus, they are related. It is a pleasure to dedicate this article with best wishes on 
his 70th birthday. 

I would like to thank J. Christiansen for producing Figure 1 (in Section [7]) in 
Maple, and C. Berg, F. Gesztesy, L. Golinskii, D. Lubinsky, F. Marcellan, E. Saff, 
and V. Totik for useful discussions. 



We begin with a result that is an aside which we include because it deserves to 
be better known. It was rediscovered and popularized by Berg [11] . who found it 
earliest in a 1939 paper of Collar [24], who attributes it to his teacher, Aitken — so 
we dub it the ABC theorem. Given that it is essentially a result about Gram- 
Schmidt, as we shall see, it is likely it really goes back to the nineteenth century. 
For applications of this theorem, see |13L 147] . 

K n is a polynomial of degree n in z and so we can define an (n+1) x (n + 1) 
square matrix, k^ n \ with entries fc^™], < j, m < n, by 




(1.18) 




(1.19) 



2. The ABC Theorem 



n 




(2.1) 



j,m— 



One also has the moment matrix 




(2.2) 
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< j,k < n. For OPRL, this is a function of j + k, so m^ n ' is a Hankel matrix. 
For OPUC, this is a function of j — k, so m' n ) is a Toeplitz matrix. 

Theorem 2.1 (ABC Theorem). 

(m^)- 1 = fcW (2.3) 

Proof. By (flTTST) for £ = 0, . . . , n, 

K n (z,C)z e dn(z) = C e (2.4) 
Plugging (|2.1p in for iT, using (|2.2p to do the integrals leads to 

n 

E = (2-5) 

j,q=o 

which says that 

which is HO. □ 



Here is a second way to see this result in a more general context: Write 

3 

Xj(z) = ^2a jk z k (2.7) 

fc=0 

so we can define an (n + 1) x (n + 1) triangular matrix a( n ) by 

aj } = a jfc (2.8) 
Then (the Cholesky factorization of k) 

fcW= a ( n )(oW)* (2.9) 
with * Hermitean adjoint. The condition 

(xj,x e ) = 6jt (2-10) 

says that 

(a(™))*m( n V n) ) = 1 (2-11) 
the identity matrix. Multiplying by (a^)* on the right and [(a^)*] -1 on the left 
yields (12. 3| . This has a clear extension to a general Gram-Schmidt setting. 

3. The Christoffel Darboux Formula 

The Christoffef-Darboux formula for OPRL says that 



r- , ,' Pn+l(z) Pn{Q ~ Pn(z) p n+ l(C) \ 

K„(2, C) = On+1 ( J— ^ ) (3.1) 



and for OPUC that 



1 - z( 

The conventional wisdom is that there is no CD formula for general OPs, but we 
will see, in a sense, that is only half true. The usual proofs are inductive. Our 
proofs here will be direct operator theoretic calculations. 
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We focus first on (13. 1| . From the operator point of view, the key is to note 
that, by ([TTTTj) . 



(g,[M z ,ir n ]f) = J g((;)(C-z)K n (z,Of(z)dn(C)dn(z) (3.3) 

where [A,B] = AB - BA. For OPRL, in (pD?]) . C and z are real, so dHJ for 
z,( £ <r(dfi) is equivalent to 

[M Z! 7T„] = a„+l[(p„, ')Pn+l - (Pn+1, - )Pn] (3.4) 

While (|3.4|) only proves (|3.ip for such z, £ by the fact that both sides are polynomials 
in z and £, it is actually equivalent. Here is the general result: 

Theorem 3.1 (General Half CD Formula). Let /j be a measure on C with finite 
moments. Then: 

(l-7r n )[M,,7r tt ](l-7r n ) = (3.5) 

7T„[M z ,7T„]7r„ = (3.6) 

(1 - iT n )[M z ,Tv n ]w n = -^4- (x n , ■ )x n+x (3.7) 

Remark. If \x has compact support, these are formulae involving bounded 
operators on L 2 (C,d[i). If not, regard ir n and M z as maps of polynomials to 
polynomials. 

Proof. (|3.5p follows from expanding [M z ,ir n ] and using 

7T„(1 - 7T„) = (1 - 7T n )7T n = (3.8) 

If we note that [M z ,7r„] = — [M z , (1 — 7r n )], (|3 . 6[) similarly follows from (|3 . 8[) . By 
(|3.8|) again, 

(1 - 7r n )[M z ,7r„]7r„ = (1 - 7r„)M z 7r„ (3.9) 

On ran(7r„_i), 7r„ is the identity, and multiplication by z leaves one in 7r„, that is, 

(1 - n n )M z n„ r ran( 7 r n _ 1 ) = (3.10) 

On the other hand, for the monic OPs, 

(1 - 7r n )M z ir n X n = X n+1 (3.11) 

since M z ir n X n = z n+1 + lower order and (1 — 7r„) takes any such polynomial to 
X n+ \. Since 

ll-^rc+lll / v \ v 



ll^n., 

we see (|3.4|) holds on ran(l — ir n ) + ran(7r n _i) + [X n ], and so on all of L 2 . □ 

From this point of view, we can understand what is missing for a CD formula 
for general OP. The missing piece is 

n n [M z , 7T„](1 - 7T n ) = ((1 - 7T n )M> n )* (3.12) 

The operator on the left of (|3.7p is proven to be rank one, but (1 — l K n )M*-K n is, in 
general, rank n. For ip £ ker[(l — 7r n )Af*7r rl ] nran(7r n ) means that ip is a polynomial 
of degree n and so is zip, at least for a.e. z with respect to /x. Two cases where many 
zip are polynomials of degree n — indeed, so many that (1 — ■n n )M*ir n is also rank 
one — are for OPRL where zip = zip (a.e. z G <r(d/i)) and OPUC where zip = 
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(a.e. z 6 a{dji)). In the first case, zip € ran(7r„) if deg(^) < n— 1, and in the second 
case, if ip(0) = 0. 

Thus, only for these two cases do we expect a simple formula for [M Zl tt]. 
Theorem 3.2 (CD Formula for OPRL). For OPRL, we have 

[M z ,TT n ] = a n +l[(p n , - )Pn+l - (Pn+1, ')Pn] (3.13) 

and (|3.ip holds for z ^ C- 

Proof. Inductively, one has that p n (x) — (ai . . . a n ) x n + . . . , so 

||P„|| =a 1 ...a„ / i(M) 1 / 2 (3.14) 

and thus, 

"p^r - an+1 (3 5) 

Moreover, since M* = M z for OPRL and [A, B]* = -[A*,B*], we get from 
(gUgl that 

TT n [M z , 7r n ](l -7T„) = -a n+ i(p„+i, -)p„ (3.16) 
(|33 ]1 -(|3T |1 . (|3T4| . and (f3TT6]l imply (|3~13| which, as noted, implies j3T]). □ 



For OPUC, the natural object is (note M Z M* = M*M Z = 1) 

B n = 7r„ - M z ir n M* z = -[M z ,n n ]M* z (3.17) 
Theorem 3.3 (CD Formula for OPUC). For OPUC, we have 

TT n - M z 1T n M* = (f* n+1 , -)Vn+l - (fn+1, ■ )<Pn+l (3.18) 

and (|3.2[) holds. 

Proof. i?„ is selfadjoint so ran(i? n ) = ker(.B n ) . Clearly, ran(£?„) C 
ran(7r„) + M z [ran(7r„)] = ran(7r„ +1 ) and B n z l — for I — 1, . . . , n, so ran(B n ) = 
{z,z 2 , . . . ,2"}^ n ran(7r, i+ i) is spanned by (p n +i and p* l+ i- Thus, both B n and 
the right side of (|3 . 18[) are rank two selfadjoint operators with the same range and 
both have trace 0. Thus, it suffices to find a single vector r\ in the span of ip n +\ and 
fn+i w hh B n rj = (RHS of (|3.18[) )t7. since a rank at most one selfadjoint operator 
with zero trace is zero! 

We will take rj = z<p n , which lies in the span since, by (|1.9|) and its * , 

PnPn+l = Zlfin ~ OL n ip* n PnVn+l =( Pn~ a riZip n (3.19) 

By (l3~T6l) . (j3T7l) . and 

\\$n\\=Po---Pn-iridV>) (3-20) 

we have that 

B n (zip n ) = [w n ,M z ]ip n 

= -(1 - n n )M z TT n ip n 

= -PnVn+l (3-21) 

On the other hand, ip^+i J- {z, ■ ■ • , z™ +1 }, so 

{fn+l,Z<Pn) = 

and, by ([3TT^ . 
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= Pn 

SO 

[LHS of il|]z^ = -Pn^n+l (3.22) 

□ 

Note that (pH)) implies 

K n+ i(z, C) = K n (z, C) + v? n +i(z) y„+i(C) 

= <+l ( z ) In+l (0 ~ (z) <gn+l (C) 

so changing index, we get the "other form" of the CD formula for OPUC, 

Kn{ = M^y;(C)-^)Cyn(c) (323) 

1 — Zip 

We also note that Szego }93j derived the recursion relation from the CD formula, 
so the lack of a CD formula for general OPs explains the lack of a recursion relation 
in general. 

4. Zeros of OPRL: Basics Via CD 

In this section, we will use the CD formula to derive the basic facts about the 
zeros of OPRL. In the vast literature on OPRL, we suspect this is known but we 
don't know where. We were motivated to look for this by a paper of Wong [105j . 
who derived the basics for zeros of POPUC (paraorthogonal polynomials on the 
unit circle) using the CD formula (for other approaches to zeros of POPUCs, see 
[191 186j ). We begin with the CD formula on diagonal: 

Theorem 4.1. For OPRL and x real, 

n 

^2 M X )| 2 = a n+l[Pn+l(x)Pn(x) - p' n (x)p n+1 (x)} (4.1) 
3=0 

Proof. In (|3.1|) with z — x, ( — y both real, subtract p n+ i(y)p n (y) from both 
products on the left and take the limit as y — > x. □ 

Corollary 4.2. Ifp n (x ) = for x real, then 

p n +i{x )p' n (x ) < (4.2) 

Proof. The left-hand side of (|4.1|) is strictly positive since po(x) = 1. □ 

Theorem 4.3. All the zeros ofp n (x) are real and simple and the zeros of p n +i 
strictly interlace those of p n . That is, between any successive zeros of p n +i lies 
exactly one zero of p n and it is strictly between, and p n +\ has one zero between 
each successive zero of p n and it has one zero above the top zero of p n and one 
below the bottom zero of p n . 

Proof. By (|4.2p . p n (x Q ) = p' n {xo) ^ 0, so zeros are simple, which then 
implies that the sign of p' n changes between its successive zeros. By (|4.2I) . the sign 
of Pn+i thus changes between zeros of p n , so p n +i has an odd number of zeros 
between zeros of p n . 
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pi is a real polynomial, so it has one real zero. For x large, p n (x) > since 
the leading coefficient is positive. Thus, p' n (xo) > at the top zero. From (14.21) . 
Pn+i{xo) < and thus, since p n +\{x) > for x large, p n +i has a zero above the 
top zero of p n . Similarly, it has a zero below the bottom zero. 

We thus see inductively, starting with p±, that p n has n real zeros and they 
interlace those of p n -i- D 

We note that Ambroladze [3] and then Denisov-Simon [29] used properties of 
the CD kernel to prove results about zeros (see Wong [105j for the OPUC analog) ; 
the latter paper includes: 

Theorem 4.4. Suppose = sup„ a n < oo and x € K has d — 

dist(xo, <j(dn)) > 0. Let S = d 2 /(d+ v2<2oo)- Then at least one of p n and p n -i has 
no zeros in (xq — 5, xq + S). 

They also have results about zeros near isolated points of <j(dfi). 



5. The CD Kernel and Formula for MOPs 

Given an I x I matrix-valued measure, there is a rich structure of matrix OPs 
(MOPRL and MOPUC). A huge literature is surveyed and extended in [27] • In 
particular, the CD kernel and CD formula for MORL are discussed in Sections 2.6 
and 2.7, and for MOPUC in Section 3.4. 

There are two "inner products," maps from L 2 matrix-valued functions to ma- 
trices, (( • , • ))r and (( • , • The R for right comes from the form of scalar homo- 
geneity, for example, 

((f, 9 A)) K = ((f,g)) n A (5.1) 

but ({f,Ag)) R is not related to ((/,g)) R . 

There are two normalized OPs, pf (x) and pj'(x), orthonormal in (( ■ , ■ )}r and 
(( • , • respectively, but a single CD kernel (for z, w real and ^ is matrix adjoint), 

n 

K n (z,w) = J2Pk(z)PkM i (5-2) 

n 

= 5>£ ( zM M (5.3) 

fc=0 

One has that 

<(^(-,z),/(-)))r = K/)(z) (5.4) 

where 7r„ is the projection in the Tr((( • , • ))r) inner product to polynomials of 
degree n. 

In [27] . the CD formula is proven using Wronskian calculations. We note here 
that the commutator proof we give in Section [3] extends to this matrix case. 

Within the Toeplitz matrix literature community, a result equivalent to the CD 
formula is called the Gohberg-Semencul formula; see [Hfl [Ml [Ml Hffl HS [TOO] 
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6. Gaussian Quadrature 



Orthogonal polynomials allow one to approximate integrals over a measure d\x 
on M by certain discrete measures. The weights in these discrete measures depend 
on K n (x,x). Here we present an operator theoretic way of understanding this. 

Fix n and, for feel, let J n -,F(b) be the n x n matrix 



(6.1) 



(i.e., we truncate the infinite Jacobi matrix and change only the corner matrix 
element b n to b n + b). 

Let Xj(b), j — 1, . . . , n, be the eigenvalues of J n -F{b) labelled by x\ < X2 < 
. . . . (We shall shortly see these eigenvalues are all simple.) Let (pj 1 be the nor- 







«i 





\ 




ai 


h 


«2 




Jn;F{b) = 





«2 


b 3 






V 






bn + b) 



malized eigenvectors with components [ipj (b)]e, 1= 1, . . . , n, and define 

\f\b) = \\^f\b)\ x f 
so that if e± is the vector (10... 0)* , then 

n 



(6.2) 



i=i 



x) n > (6) 

is the spectral measure for J n -F(b) and e±, that is, 

n 

(e 1 ,J n . tF (b) i e 1 ) = J2^ n) (b)^ n) (bY 

3 = 1 

for all £. We are going to begin by proving an intermediate quadrature formula 



(6.3) 



(6.4) 



Theorem 6.1. Let /i be a probability measure. For any b and any £ — 
0,l,...,2n-2, 



/n 



(6.5) 



7/6 = 0, this holds also for £ = 2n — 1. 



PROOF. For any measure, {a>j,bj}^_^ determine {pj}™-Q, and moreover, 

2 



x|p n _i(x)| d^ = b n 



(6.6) 



If a measure has finite support with at least n points, one can still define {pj}™ = Q , 
Jacobi parameters {a,-, &j}™=i ( and b n by (|6.6p . 

dfi and the measure, call it d/ij , of (|6.3p have the same Jacobi parameters 
{aj, 6j}?=i , so the same {pj}™Zoi an d thus by 



x k pj(x) d/i — fe = 0, 1, . . . , j — 1; j = 1, . . . ,n — 1 



(6.7) 
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we inductively get (16. 5|) for I — 0, 1, 2, . . . , 2n — 3. Moreover, 

p n ^(x) 2 d^=l (6.8) 



determines inductively (|6.5p for £ = 2n — 2. Finally, if b = 0, (|6.6p yields (|6.5|) for 
£ = 2n-l. □ 

As the second step, we want to determine the x^ n \b) and X^ n \b). 

Theorem 6.2. Let K n . F = ix n -\M z ii n -\ \ ran(7r„_i) for a general finite mo- 
ment measure, fi, on C. Then 

det ran ( Wn _ 1 )(zl - K n . F ) = X„(z) (6.9) 

Proof. Suppose X n (z) has a zero of order £ at zq. Let ^ = X n (z)/(z — zqY . 
Then, in ran(-7r„), 

(K n . F -z ) j ip^0 j=0,l,...,£-l (6.10) 
(K n , F - z )V = (6.11) 

since (M z — zqY<p = X n (z) and ir n ^iX n = 0. Thus, zq is an eigenvalue of K n - F of 
algebraic multiplicity at least I. Since X n (z) has n zeros counting multiplicity, this 
accounts for all the roots, so (|6.9[) holds because both sides of monic polynomials 
of degree n with the same roots. □ 

Corollary 6.3. We have for OPRL 

det(z - J n , F {b)) = P n {z) - bPn-x(z) (6.12) 

The eigenvalues Xj- n '(6) are all simple and obey for < b < oo and j = l,...,n 
(with x ra +i(0) = oo), 

4 n) (0) <x { ;\b) <z£\(0) (6.13) 
and for — oo < 6 < and j = 1, . . . , n (with i„_i(0) = — oo), 

(0) <if\b) <4 n) (0) (6.14) 

Proof. (I6.12p for b = is just (|6.9p . Expanding in minors shows the determi- 
nant of (z — J n - F (b)) is just the value at b = minus b times the (n — 1) x (n — 1) 
determinant, proving (|6.12p in general. 

The inequalities in (|6.13|) / (|6. 14(1 follow either by eigenvalue perturbation theory 
or by using the arguments in Section [4j □ 

In fact, our analysis below proves that for < b < oo, 

4 n) (0) <4 n) (6) <ij n - 1) (0) (6.15) 

The recursion formula for monic OPs proves that Pj(xj(b)) is the unnormalized 
eigenvector for J n;F (b). K n —x(xj (6), XjQ))) 1 ' 2 is the normalization constant, so 
since po = 1 (if /j(R) = 1): 

Proposition 6.4. // ^(R) = 1, f/ien 

A^(6) = (ir n _ 1 (^(" ) (&),M n) (6)))- 1 (6.16) 
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Now fix n and xq € R. Define 

b(x ) = ~^~~~T~T (6-17) 
with the convention b — oo if P„_i(a;o) = 0. Define for b ^= oo, 

4 n) (^)=4" ) (K^o)) j=l,...,n (6.18) 

and if b(x<y) = oo, 

^(zq) = xf- 1] {Q) j = 1, . . .,n- 1 (6.19) 

and 

A^Cro) = (^-i^K),^^)))" 1 (6.20) 
Then Theorem 16.11 becomes 



Theorem 6.5 (Gaussian Quadrature). Fixn,xo. Then 

„ n 

j Q(x)d(i = ^2^\^)Q(x { ;\x Q )) (6.21) 

3=1 

/or polynomials Q of degree up to: 

(1) 2ra- 1 ifP n (x Q ) = 

(2) 2n-2 ^^/O/^M 

(3) 2n-3 tfP„_i(s ) =0. 

Remarks. 1. The sum goes to n — 1 if P„_i(x ) = 0. 

2. We can define to be the solutions of 

p n -l(x )p n (x) - p n (x )Pn-l(x) = (6.22) 

which has degree n if p n ^i(xo) 7^ and n — 1 if p n _i(xo) = 0. 

3. (|6.20p makes sense even if /x(R) 7^ 1 and dividing by /x(R) changes / <3(x) d// 
and a!| by the same amount, so (|6 . 2 1[) holds for all positive /1 (with finite mo- 
ments), not just the normalized ones. 

4. The weights, A^" (xo), in Gaussian quadrature are called Cotes numbers. 

7. Markov Stieltjes Inequalities 

The ideas of this section go back to Markov |63j and Stieltjes [92] based on 
conjectures of Chebyshev [21] (see Freud [34] ) . 

Lemma 7.1. Fix xx < • • • < x n in R distinct and 1 < I < n. Then there is a 
polynomial, Q, of degree 2n — 2 so that 

(i) 

Qto) = \l \ !'.","' ^ 

10 1 — t + 1, . . . ,n 

(ii) For all x E R, 

Q(x) > X(-oo,x t ](x) (7.2) 
Remark. Figure 1 has a graph of Q and X(-oo,x e ] for n = 5, 1 = 3, Xj = j — 1. 
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Figure 1. An interpolation polynomial 

Proof. By standard interpolation theory, there exists a unique polynomial of 
degree k with k + 1 conditions of the form 

Q(y j ) = Q'(yj) = --- = Q {n * ) (y j ) = o 

^2 j rij = k + 1. Let Q be the polynomial of degree 2n — 2 with the n conditions in 
(|7.ip and the n — 1 conditions 

Q / (x i ) = j = l,...,£-l,£+l,...,n (7.3) 

Clearly, Q' has at most 2n— 3 zeros, n — 1 are given by (|7.3[) and, by Snell's the- 
orem, each of the n — 2 intervals [x\, X2), ■ ■ ■ , (2^-1, Xt), (2^+1, 2^+2), ■ ■ ■ , (x«.-i> x n ) 
must have a zero. Since Q' is nonvanishing on (xi,xg+i) and Q(xe) = 1 > 
Q(xe+i) = 0, Q'(y) < on (ai^, 2^+1). Tracking where Q' changes sign, one sees 
that (O) holds. □ 



Theorem 7.2. Suppose d/i is a measure on K with finite moments. Then 

t^s-^ — ^ — > oo,x ]) 

r., (»), ^ 1 K n -x(x) >(x ),x) '(x )) 

> ^((— 00, a; )) > 7-t — p> — 

{il^^Xxo}^"- 1 ^- (^0)^} (*<>)) 



(7.4) 



Remarks. 1. The two bounds differ by K n -i(x a , xq)^ 1 . 
2. These imply 

K{ x o}) < Kn-^xo^xoy 1 (7.5) 
In fact, one knows (see (|9.21j) below) 

K{ x o}) = hm K n -x{x iXo)~ X (7.6) 

n—*oo 

If {i({xq}) = 0, then the bounds are exact as n — > oo. 
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PROOF. Suppose P n ^i(xo) ^ 0. Let t be such that x^ 1 \xq) — Xq. Let Q be 
the polynomial of Lemma 17.11 By (|7.2p , 

fi((-oo,x ]) < / Q(x)dn 



and, by (|7.1[) and Theorem 16.51 the integral is the sum on the left of (|7.4p . 
Clearly, this implies 

^i((xo,oo)) > 7— r t^t 

r., (»)/ w x K n -i(x y>(xv),x ,-(£<))) 

which, by x — > — x symmetry, implies the last inequality in (|7.4p . □ 

Corollary 7.3. If I < k - 1, then 
fc-i 



j=t+l K{X) >{X ),X) >{x Q )) 



' 1 



(7.7) 



^K(x^(x ),x^(x )) 



PROOF. Note if x\ = x^ 1 '(xq) for some i, then Xj(xo) — x^\x\), so we get 



(|7.7[) by subtracting values of (|7.4p . □ 

Notice that this corollary gives effective lower bounds only if fc— 1 > 1+1, that 
is, only on at least three consecutive zeros. The following theorem of Last-Simon 
|57j . based on ideas of Golinskii [41] . can be used on successive zeros (see |57j for 
the proof). 

Theorem 7.4. If E,E' are distinct zeros of P n (x), E = |(E + E') and S > 
\\E-E'\, then 



E-E' > 



5*-$\E-E>\*f 



3n 



K n (E,E) 



1/2 



(7- 



8. Mixed CD Kernels 



Recall that given a measure fi on R with finite moments and Jacobi parameters 
{a n , 6 n }5^Li, the second kind polynomials are defined by the recursion relations (|1.5p 
but with initial conditions 

q Q {x) = q 1 (x)=ai 1 (8.1) 

so q n (x) is a polynomial of degree n — 1. In fact, if fi is the measure with Jacobi 
parameters given by 

a>n — a n+i b n = b n+ i 

then 

q n (x; dfi) = aj" 1 p n _ 1 (x; d/i) (8.2) 
It is sometimes useful to consider 

n 

K^{x,y)=Y J W)l 3 (v) (8-3) 
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and the mixed CD kernel 

n 

K^(x,y) = Y,v(xjpM (8-4) 

3=0 

Since (|8.2|) implies 

K^{x,y;dn) = a^ 2 K n _ x (x,y;dfl) (8.5) 

there is a CD formula for which follows immediately from the one for K. 

There is also a mixed CD formula for Kn q ^ ■ 

OPUC also have second kind polynomials, mixed CD kernels, and mixed CD 
formulae. These are discussed in Section 3.2 of [80] , 

Mixed CD kernels will enter in Section [5T] 

9. Variational Principle: Basics 

If one thing marks the OP approach to the CD kernel that has been missing 
from the spectral theorists' approach, it is a remarkable variational principle for 
the diagonal kernel. We begin with: 



Lemma 9.1. Fix (an, ... , a m ) e C m . Then 



min(5>| 2 X>^ = 1 ) = (El a il a ) (9.1) 

■3=1 3=1 ' S'=l 



-1 



with the minimizer given uniquely by 

jo) _ 



El n 
3=1 



a 



2 



(9.2) 



Remark. One can use Lagrange multipliers to a priori compute z^ and prove 



this result 
Proof. If 

5>^ = 1 (9.3) 

3 = 1 

then 

m m / m \ — 1 

E^fi^EM 2 - En 2 ( 9 - 4 ) 
j=i j=i v j=i 1 

from which the result is obvious. □ 
If Q has deg(Q) < n and Q n (z ) — 1, then 

n 

Qn{z) = J2 a 3 X o( Z ) ( 9 ' 5 ) 

3=0 

with Xj the orthonormal polynomials for a measure d/i, then ^2 ctjXj(zo) = 1 and 
IIQwlH^CdM) = Z)"=ol a jl 2 - Tllus tnc lemma implies: 

Theorem 9.2 (Christoffel Variational Principle). Let ^ be a measure on C with 
finite moments. Then for zq £ C, 

,(js ) = 1, deg(Q„) < n ) = —± (9.6) 

J K n {z ,z ) 



mm(^J\Q n (z)\ 2 dfj, 
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and the minimizer is given by 

One immediate useful consequence is: 
Theorem 9.3. If n<v, then 

K n (z,z;dv) < K n (z,z;d(i) (9.8) 
For this reason, it is useful to have comparison models: 

Example 9.4. Let d[i = dd/2n for z = re l8 and ( = e lip . We have, since 

tp n (z) = z n , 

i _ r n+l p i(n+l)(tp—0) 

If r < 1, K n (z, zo) has a limit as n — > oo, and for z = e J(p , z = re 10 , r < 1, 

|Q„(z,z )| 2 g^P r (0,^) J (9.10) 

the Poisson kernel, 
For r = 1, we have 

i*.<^)i' = ^'f% ( '7" <" 2 ' 

sin (0 — </?) 

the Fejer kernel. 

For r > 1, we use 



which implies, for z = e lv , zq = re , r > 1 



\Q n (z,z )\ 2 ^ ^P^e^)^ (9.14) 

□ 

Example 9.5. Let d/io be the measure 

d/J, (x) = V4^X[- 2 ,2]W dx (9.15) 

on [—2, 2]. Then p n are the Chebyshev polynomials of the second kind, 

sin(n + 1)0 „n 

Pn(2cosg) = \ , ; (9.16) 
sin 

In particular, if \x\ < 2 — (5, 

\p n {x + iy)\<C h6 e nC ^\ (9.17) 

and so 

±|tf„(a; + iy,a; + ty)| <C\ b e 2nC ^ (9.18) 
n 

□ 

The following shows the power of the variational principle: 
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Theorem 9.6. Let 

dfj, = w(x) dx + d/j, s (9.19) 
Suppose for some xq, 5, we have 

w(x) > c > 

for x £ [xq — 5, Xq + 5]. Then for any 5' < 5 and all x £ [xq — 5' , xq + 5'], we have 
for all a real, 

l K Jx + -,x + -\<C 1 e c ^ (9.20) 



Proof. We can find a scaled and translated version of the dfio of l|9.15p with 
M > Mo- Now use Theorem [931 and (|9TT5|) . □ 

The following has many proofs, but it is nice to have a variational one: 

Theorem 9.7. Let [i be a measure on R of compact support. For all xq £ R, 

lim K n (x ,x Q ) ^^({xo})- 1 (9.21) 

n — >oo 

Remark. If h({xq}) = 0, the limit is infinite. 

Proof. Clearly, if Q(x ) = 1, J\Q n (x)\ 2 d/j, > n({x }), so 

K n {x 0l x Q ) < ^{{xq})- 1 (9.22) 
On the other hand, pick A > diam(<r(d/i)) and let 

Q 2n (x) = (l-^^-J (9.23) 

For any a, 

sup \Q 2n (x)\ = M 2n (a) ^ (9.24) 

\x — Xq I >a 

so, since Q 2n < 1 on a(d[i), 

K n {x ,x ) > H(x -a,x + a)) + M 2n (a)]- 1 (9.25) 

so 

liminf K n (x ,x ) > \p((x - a,x Q + a))] (9.26) 

for each a. Since lim a j. /i((^o — a,x Q + a)) = ^({x }), (|9.22[) and (|9.26|) imply 

dOTH . □ 

10. The Nevai Class: An Aside 

In his monograph, Nevai [67) emphasized the extensive theory that can be 
developed for OPRL measures whose Jacobi parameters obey 

a n — > a b n — > b (10-1) 

for some b real and a > 0. He proved such measures have ratio asymptotics, that is, 
P n+ i(z) / P n (z) has a limit for all z £ C\R, and Simon [78j proved a converse: Ratio 
asymptotics at one point of C+ implies there are a, b, with ([10. ip . The essential 
spectrum for such a measure is [b ~ 2a ,b + 2a], so the Nevai class is naturally 
associated with a single interval e C R. 

The question of what is the proper analog of the Nevai class for a set e of the 
form 

e = [ai,0i] U [a 2 , ft] U . . . [at +1 ,0 t+1 ] (10.2) 



18 



B. SIMON 



with 

a x < fa < ■ ■ ■ < a l+1 < (3 l+1 (10.3) 

has been answered recently and is relevant below. 

The key was the realization of Lopez [H [9] that the proper analog of an arc 
of a circle was \a n \ —> a and a n +\a n — * a 2 for some a > 0. This is not that a n 
approaches a fixed sequence but rather that for each k, 

min V la,- - ae i9 \ -> (10.4) 

as — ► oo . Thus, ay approaches a set of Verblunsky coefficients rather than a fixed 
one. 

For any finite gap set e of the form (I10.2[) / (|10.3p . there is a natural torus, 
J7e, of almost periodic Jacobi matrics with a ess (J) = e for all J £ J t . This can be 
described in terms of minimal Hcrglotz functions [901 189] or reflectionless two-sided 
Jacobi matrices |75j . All J € J t are periodic if and only if each [atj, f3j] has rational 
harmonic measure. In this case, we say e is periodic. 

Definition. 



oo 



d m ({an,b n }%L l7 {a„,6„}£° =1 ) = e °{\a m + 3 ~ a m+j \ + \b m+j - b m+j \) (10.5) 

3=0 

d m {{a n ,b n },J t ) = min d m ({a n ,b n }, J) (10.6) 

Definition. The Nevai class for e, N(e), is the set of all Jacobi matrices, J, 
with 

d m (J,Je)^0 (10.7) 

as tyi — ► oo. 

This definition is implicit in Simon [81] : the metric d m is from [26] . Notice 
that in case of a single gap e in <9B, the isospectral torus is the set of {a n }^L with 
a n = ae lS for all n where a is e dependent and fixed and 9 is arbitrary. The above 
definition is the Lopez class. 

That this is the "right" definition is seen by the following pair of theorems: 

Theorem 10.1 (Last-Simon [56]). If J e N(e), then 

<J CSS (J) = e (10.8) 
Theorem 10.2 ([26] for periodic e's; [75] in general). If 

v css (J) = o- ac (J) = e 

then J £ N(t). 

11. Delta Function Limits of Trial Polynomials 

Intuitively, the minimizer, Q n (x, xq), in the Christoffel variational principle 
must be 1 at z and should try to be small on the rest of a(d/i). As the degree gets 
larger and larger, one expects it can do this better and better. So one might guess 
that for every S > 0, 

sup \Q n (x,x )\ -> (11-1) 

\x — x \>S 
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While this happens in many cases, it is too much to hope for. If x\ G er(d/i) but 
fj, has very small weight near x\, then it may be a better strategy for Q n not to 
be small very near x\ . Indeed, we will see (Example II 1 .3[) that the sup in 
can go to infinity. What is more likely is to expect that \Q n (x, xo)\ 2 d/i will be 
concentrated near xq. We normalize this to define 

J \Qn{x,x )\ 2 dfj,(x) 

so, by l|9^ ]) /fgT p . in the OPRL case, 

(11.3) 

J\ n yX : Xq ) 

We say fi obeys the Nevai (^-convergence criterion if and only if, in the sense of 
weak (aka vague) convergence of measures, 

d V ^'\x)^S X0 (11.4) 

the point mass at xq. In this section, we will explore when this holds. 

Clearly, if x $ c(dju), (|1 1 .4[) cannot hold. We saw, for OPUC with d[i = d9 /2tt 
and z ^ 9B, the limit was a Poisson measure, and similar results should hold for 
suitable OPRL. But we will see below (Example lll.2[) that even on a(d/i), (| 1 1 .4f) 
can fail. The major result below is that for Nevai class on e lnt , it does hold. We 
begin with an equivalent criterion: 

Definition. We say Nevai's lemma holds if 

lim KMP (U5) 

»^oo K n (XQ,X ) 

Theorem 11.1. If dfi is a measure on K with bounded support and 

inf a n > (11.6) 

n 

then for any fixed xq € R, 

{TO]) (fTO]) 

Remark. That (|11.5p =>■ (|1 1 .4[) is in Nevai [57]. The equivalence is a result of 
Breuer-Last-Simon 



Proof. Since 



K n -i(x ,X ) \Pn(xo)\' 



K n (x ,x ) K n (x ,x ) 



(11.7) 



K n (x a ,x ) 



so 

cud 

We thus conclude 



|p ra+ i(a:o)| 2 _ \p n+ i{x )\ 2 K n+1 (x 0l x ) _^ 
K n (x ,x ) K n+1 (x ,x ) K n (x ,x ) 

ra ^W + k±M^ fl (119) 

K n (x 0l XQ) 

By the CD formula and orthonormality of Pj(x), 

\x- x \ 2 \K n (x,x )\ 2 dfi = a 2 n+1 \p n (x ) 2 +p n +i{x ) 2 } (11.10) 
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so, by HO]) and (fTTTTOj) . 

y - x \ 2 d^°\x) -> o & dnm 

when On is uniformly bounded above and away from zero. But since <i77„ have 
support in a fixed interval, 

dl ^ J\x-x Q \ 2 d^ -> □ 

Example 11.2. Suppose at some point xo, we have 

lim (b„(x )| 2 + K+^xq)! 2 ) 1 /" -> A > 1 (11.11) 

n — >oo 

We claim that 

,. \Pn(xo)\ 2 h1 .„> 

hmsup — — r > (11.12) 

n-too K n {Xo,Xo) 

for if (| 11 . 1 2[) fails, then (111.5P holds and, by (|11.7D . for any e, we can find -/V so 
for n > N , 

K n+1 (x ,x ) < (l + E)K n (x ,x ) (11.13) 

so 

lim K n (x a ,x Q ) 1/n < 1 

So, by PT3|) , (fTTTTTj) fails. Thus, (fTTTTTJl implies that (fTT3| fails, and so (fTO)) 

fails. □ 

Remark. As the proof shows, rather than a limit in (|11.12p . we can have a 
liminf > 1. 



The first example of this type was found by Szwarc [94| . He has a d/i with 
pure points at 2 — n^ 1 but not at 2, and so that the Lyapunov exponent at 2 was 
positive but 2 was not an eigenvalue, so (jll.lip holds. The Anderson model (see 
[20]) provides a more dramatic example. The spectrum is an interval [a, b] and 
(jll.lip holds for a.e. x G [a, b]. The spectral measure in this case is supported at 
eigenvalues and at eigenvalues (|11.8p . and so (|11.4p holds. Thus (111 .4[) holds on a 
dense set in [a,b] but fails for Lebesgue a.e. xo! 

Example 11.3. A Jacobi weight has the form 

dn(x) = Ca, b {l- x) a {l + x) b dx (11-14) 

with a, b > —1. In general, one can show [93j 

Pn (l) ~ cn a+1 / 2 (11.15) 

so if x e (—1, 1) where |p„(a;o)| 2 + \p 7 i-i{xo)\ 2 is bounded above and below, one 
has 

1^0,1)1 „ = n a-l/2 

^n(a^o,^o) n 

so if a > 1, |Qn(^0) 1)1 — y °° ■ Since d/j,(x) is small for x near 1, one can (and, as 
we will see, does) have (jll.4l) even though (jll.ip fails. □ 

With various counterexamples in place (and more later!), we turn to the positive 
results: 
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Theorem 11.4 (Nevai [67], Nevai-Totik-Zhang ^69] ). If dp, is a measure in 
the classical Nevai class (i.e., for a single interval, e = [6 — 2a, 6 + 2a]), then (|11.5p 
and so (|11.4|) holds uniformly on e. 

Theorem 11.5 (Zhang [108] . Breuer-Last-Simon [S]). Let e be a periodic 
finite gap set and let fj, lie in the Nevai class for z. Then (| 1 1 . 5[) and so .4[) holds 
uniformly on e. 

Theorem 11.6 (Breuer-Last-Simon [14] ). Let e be a general finite gap set and 
let \x lie in the Nevai class for e. Then (|11.5[) and so Q11.4p holds uniformly on 
compact subsets of e lnt . 

Remarks. 1. Nevai [67] proved (|10.4[) / (|10.5[) for the classical Nevai class for 
every energy in e but only uniformly on compacts of e lnt . Uniformity on all of e 
using a beautiful lemma is from [69] . 

2. Zhang [108] proved Theorem 111.51 for any fi whose Jacobi parameters ap- 
proached a fixed periodic Jacobi matrix. Breuer-Last-Simon [14] noted that with- 
out change, Zhang's result holds for the Nevai class. 

3. It is hoped that the final version of [14] will prove the result in Theorem lll.61 
on all of e, maybe even uniformly in e. 

Example 11.7 ( [14] ). In the next section, we will discuss regular measures. 
They have zero Lyapunov exponent on cr ess (/x), so one might expect Nevai's lemma 
could hold — and it will in many regular cases. However, [14] prove that if b n = 
and a n is alternately 1 and \ on successive very long blocks (1 on blocks of size 3 n 

and 5 on blocks of size 2™ ), then d/i is regular for a(dfx) — [—2,2]. But for a.e. 
x e [-2, 2] \ [-1, 1], (ThTD and dTO) fail. □ 

Conjecture 11.8 ([S]). The following is extensively discussed in [14]: For 
general OPRL of compact support and a.e. x with respect to /i, ([10. 4[) and so ([10. 3[) 
holds. 

12. Regularity: An Aside 

There is another class besides the Nevai class that enters in variational problems 
because it allows exponential bounds on trial polynomials. It relies on notions from 
potential theory; see [42], [52], [73], [102] for the general theory and [91], [85] for the 
theory in the context of orthogonal polynomials. 

Definition. Let /i be a measure with compact support and let e = cr css (/x). 
We say fi is regular for e if and only if 

lim ( ai ...a n ) 1/n =C(t) (12.1) 

the capacity of e. 

For e = [—1, 1], C(e) = h and the class of regular measures was singled out 
initially by Erdos-Turan [32 j and extensively studied by Ullman [103] . The general 
theory was developed by Stahl-Totik [91] , 

Recall that any set of positive capacity has an equilibrium measure, p t , and 
Green's function, G c , defined by requiring G c is harmonic on C\ e, G z (z) = log|z| + 
0(1) near infinity, and for quasi-every i£e, 

lim G t {z n ) = (12.2) 
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(quasi-every means except for a set of capacity 0) . e is called regular for the Dirichlct 
problem if and only if (|12.2p holds for every x <E e. Finite gap sets are regular for 
the Dirichlet problem. 

One major reason regularity will concern us is: 

Theorem 12.1. Let e C K be compact and regular for the Dirichlet problem. 
Let fi be a measure regular for e. Then for any e, there is 5 > and C e so that 

sup \p„(z,dfi)\ <C £ e £ l"l (12.3) 

dist(z,e)<<5 

For proofs, see |91] 185] . Since K n has n+ 1 terms, (|12.3p implies 

sup \K n {z,w)\ < (n+ l)C* e 2 e 2£|n| (12.4) 

dist(2,c)<5 
dist(uj,e)<<5 

and for the minimum (since K n ( Zq^Zq) > 1), 

sup \Q n (z,z )\ < (n+l)C 2 e 2£ '"l (12.5) 

dist(js,e)<<5 
dist(zo,e)<(5 

(n) 

The other reason regularity enters has to do with the density of zeros. If Xj 
are the zeros of p n (x,dfi), we define the zero counting measure, dv n , to be the 
probability measure that gives weight to n^ 1 to each Xj. For the following, see 
[9HI85]: 

Theorem 12.2. Let e c K be compact and let \x be a regular measure for e. 
Then 

dv n — > dp,, (12-6) 

i/ie equilibrium measure for e. 

In (|12.6|) . the convergence is weak. 



13. Weak Limits 



A major theme in the remainder of this review is pointwise asymptotics of 
-^-^K n (x, y\ d\x) and its diagonal. Therefore, it is interesting that one can say some- 
thing about -^^K n {x, x; d/i) d[i(x) without pointwise asymptotics. Notice that 



dfi n (x) = 



1 



n + 1 



K n (x, x; dfi) d(j,(x) 



(13.1) 



is a probability measure. Recall the density of zeros, u n , defined after (|12.5|) . 

Theorem 13.1. Let fi have compact support. Let v n be the density of zeros 



id fi n given by (|13.ip . Then for any i — 0, 1, 2, 



du 



n+l 



x l dfi r 







(13.2) 



In particular, dfj, n ^ and dv n ^ +l have the same weak limits for any subsequence 
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PROOF. By Theorem 16.21 the zeros of P n +i are eigenvalues of Tr n M x ir n , so 

x l dv n+l = — |— Tt{{-K n M x ^ n f) (13.3) 
n + 1 

On the other hand, since {pj}^ =0 is a basis for ran(7r n ), 
/ x t dp, n = —_ — - ^T I ' x e \pj(x)\ 2 du(x) 

J 11 3 =0 J 

' Tr{7T n M^ n ) (13.4) 



n + 1 

It is easy to see that (TT n M x Tr n ) — 7r„M,j7r„ is rank at most £, so 

LHS of GO < — — ||AfJ|* 
n + 1 

goes to as n — > oo for t fixed. □ 



Remark. This theorem is due to Simon 88 although the basic fact goes back 
to Avron-Simon [JJ. 

See Simon [88j for an interesting application to comparison theorems for limits 
of density of states. We immediately have: 

Corollary 13.2. Suppose that 

du = w(x) dx + d/j, s (13.5) 

with dfi s Lebesgue singular, and on some open interval I C e = a ess (dfj,) we have 
dv n — * dvoc and 

dvoo 1 1 = ^oo(x) dx (13.6) 
and suppose that uniformly on I, 

lim — K n (x, x) = g(x) (13.7) 
n 

and w(x) ^ on I. Then 

g(x) = —rr 

w(x) 

Proof. The theorem implies dv^ \ I = w{x)g{x). □ 
Thus, in the regular case, we expect that "usually" 

I K n (x,x) (13.8) 
n w(x) 

This is what we explore in much of the rest of this paper. 

14. Variational Principle: Mate Nevai Upper Bounds 

The Cotes numbers, X n (zo), are given by (|9.6|) , so upper bounds on X n (zo) mean 
lower bounds on diagonal CD kernels and there is a confusion of "upper bounds" 
and "lower bounds." We will present here some very general estimates that come 
from the use of trial functions in (|9.6p so they are called Mate-Nevai upper bounds 
(after |65| ). although we will write them as lower bounds on K n . One advantage 
is their great generality. 
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Definition. Let dp be a measure on R of the form 

dp = w(x) dx + dp s (14-1) 

where dp s is singular with respect to Lebesgue measure. We call xq a Lebesgue 
point of p if and only if 



1 1 

x , x Q + - 

n n 



(14.2) 



n 

\w(x) - w (x a )\dx -> (14.3) 



2 

It is a fundamental fact of harmonic analysis ( [76j ) that for any p Lebesgue- 
a.e., xq in R is a Lebesgue point for p. Here is the most general version of the MN 
upper bound: 

Theorem 14.1. Let e C R be an arbitrary compact set which is regular for the 
Dirichlet problem. Let I C e be a closed interval. Let dp be a measure with compact 
support in R with o~ ess (dp) C e. Then for any Lebesgue point, x in I , 

limM -KJx,x)>^l (14.4) 

n^oo n W\x) 

where dp t \ I = p z (x) dx. If w is continuous on I (including at the endpoints as a 
function in a neighborhood of I) and nonvanishing , then (I14.4[) holds uniformly on 
I . If x n — > x G / and A — sup„ n\x n — x\ < oo and x is a Lebesgue, then (|14.4[) 
holds with K n (x, x) replaced by K n (x n , x n ). Ifw is continuous and nonvanishing on 
I , then this extended convergence is uniform in x € / and x n 's with A < Aq < oo . 

Remarks. 1. If I C e is a nontrivial interval, the measure dp e \ I is purely 
absolutely continuous (see, e.g., |85l I89j ). 

2. For OPUC, this is a result of Mate-Nevai [64]. The translation to OPRL 
on [—1,1] is explicit in Mate-Nevai-Totik [66] , The extension to general sets via 
polynomial mapping and approximation (see Sect ion [18]) is due to Totik [96] , These 
papers also require a local Szego condition, but that is only needed for lower bounds 
on A n (see Section Il7[) . They also don't state the x n — > Xoo result, which is a 
refinement introduced by Lubinsky [60) who implemented it in certain [—1, 1] cases. 

3. An alternate approach for Totik's polynomial mapping is to use trial func- 
tions based on Jost-Floquet solutions for periodic problems; see Section Q3J] (and 
also [871189] ). 

One can combine (|14.4p with weak convergence and regularity to get 

Theorem 14.2 (Simon [88 ). Let e c R be an arbitrary compact set, regular 
for the Dirichlet problem. Let dp, be a measure with compact support in R with 
o~ess(dp) = e and with dp regular for e. Let I C e be an interval so w(x) > a.e. 
on I . Then 



(i) 



— K n (x,x) w(x) - p t (x) 
n 



dx^O (14.5) 



(ii) / -K n (x,x)dp s (x)^0 (14.6) 
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Proof. By Theorems HO and [133] 

— K n (x,x) dp — + dp t (14-7) 
n 

Let v\ be a limit point of —K n (x, x) dp s and 

dvi = dp e — dv\ (14.8) 
If / > 0, by Fatou's lemma and (114. 4p . 

fdv2> J p t (x)f{x)dx (14.9) 

that is, dv2 \ I > Pe(x) dx \ I. By l|14.8[) . dv2 \ I < p?{x) dx. It follows dv\ \ I is 
and dv2 \ I = dp e \ I. 

By compactness, ^K n (x,x) dp s \ I —> weakly, implying (|14.6| . By a simple 
argument 88j, weak convergence of -K n (x, x)w(x) dx — > /3 e (a ; ) ^ and (|14.4|) imply 
(fT4~5l) . □ 



15. Criteria for A.C. Spectrum 

Define 

N = i x e 



liminf — K n (x,x) < oo ^ (15.1) 



it 



so that 



n 



lim — K n (x, x) = oo > (15.2) 



Theorem 114.11 implies 



Theorem 15.1. Let e C R fee an arbitrary compact set and dp, = w(x) dx + dp s 
a measure with u(p) — e. Let S ac = {x \ w(x) > 0}. Then N \ E ac has Lebesgue 
measure zero. 

Proof. If xo € R \ S ac and is a Lebesgue point of /i, then w(xo) — and, by 
Theorem HHl i el\JV. Thus, 

(R\S ac )\(R\iV)=iV\S ac 

has Lebesgue measure zero. □ 

Remark. This is a direct but not explicit consequence of the Mate-Nevai ideas 
|64) . Without knowing of this work, Theorem 115.11 was rediscovered with a very 
different proof by Last-Simon |55) . 



On the other hand, following Last-Simon [55) . we note that Fatou's lemma 

and 

— K n (x, x) dp(x) = 1 (15.3) 
implies 

liminf - K n (x,x) dp(x) < 1 (15-4) 



a. 



so 



Theorem 15.2 ( |55| ). E ac \ N has Lebesgue measure zero. 
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Thus, up to sets of measure zero, S ac = N. What is interesting is that this 
holds, for example, when e is a positive measure Cantor set as occurs for the almost 
Mathieu operator (a n = 1, b n = Acos(7rcm + 9), |A| < 2, A ^ 0, a irrational). This 
operator has been heavily studied; see Last [54] . 



16. Variational Principle: Nevai Trial Polynomial 

A basic idea is that if d/ii and d[i2 look alike near xq, there is a good chance 
that K n {x{j, xo; dfj,\) and K n (xq , xq ; d[i2 ) are similar for n large. The expectation 
(|13.8p says they better have the same support (and be regular for that support), 
but this is a reasonable guess. 

It is natural to try trial polynomials minimizing X n (xo, d/ii) in the Christoffel 
variational principle for A n (xo, d/i2), but Example 11.3 shows this will not work in 
general. If d[i\ has a strong zero near some other jci, the trial polynomial for d\i\ 
may be large near X\ and be problematical for d/i2 if it does not have a zero there. 
Nevai [67] had the idea of using a localizing factor to overcome this. 

Suppose eCl, a compact set which, for now, we suppose contains c(d/ii) and 
cr(<i/i2). Pick A = diam(e) and consider (with [•] = integral part) 

\2 



Then for any <5, 



sup N 2[En] (x) < e~< 5 ^ n (16.2) 

\x — x \>8 



so if Q n -2[en] (x) is the minimizcr for /ii and e is regular for the Dirichlet problem 
and /xi is regular for e, then the Nevai trial function 

will be exponentially small away from xq. 

For this to work to compare A„(xo, d^,\) and A(xo, dfj,2), we need two additional 
properties of X„(xq, dfj.%): 

(a) \ n (xo,d/j,i) > C £ e~ £n for each e < 0. This is needed for the exponential 
contributions away from xq not to matter. 

(b) 

,. ,. XnixQ^fix) 

lim limsup — - — r = 1 

eio n^co A„_ 2 [ £ „](a;, dfll) 

so that the change from Q n to Q n - 2 [en] does not matter. 
Notice that both (a) and (b) hold if 

lim nA n (a;o, dfj,) = c > (16.3) 

n — >oo 

If one only has e = a ess (d(i2), one can use explicit zeros in the trial polynomials 
to mask the eigenvalues outside e. 

For details of using Nevai trial functions, see [871 189] . Below we will just refer 
to using Nevai trial functions. 
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17. Variational Principle: Mate Nevai Totik Lower Bound 

In [66], Mate-Nevai-Totik proved: 

Theorem 17.1. Let dfj, be a measure on c© 

w(6) 

dfj, = — — d6 + dfi s 



(17.1) 



which obeys the Szego condition 




(17.2) 



(17.3) 



This remains true if \ni9oo) is replaced by X n (9 n ) with 9 n — > 9^ obeying supn|#„ — 

6oo\ < CO. 



Remarks. 1. The proof in [66] is clever but involved ( [89) has an exposition); 

it would be good to find a simpler proof. 

2. 66] only has the result 9 n — 9^. The general 9 n result is due to Findley 

m 

3. The for which this is proven have to be Lebesgue points for dfj, as well 
as Lebesgue points for log(io) and for its conjugate function. 

4. As usual, if / is an interval with w continuous and nonvanishing, and /i s (-0 = 
0, (IZSD holds uniformly if 9^ G /. 

By combining this lower bound with the Mate-Nevai upper bound, we get the 
result of Mate-Nevai-Totik [66] : 

Theorem 17.2. Under the hypothesis of Theorem \17.1\ for a.e. 9^ G 8B, 



This remains true if \ n {9oc) is replaced by \ n (9 n ) with 9 n 9^ obeying supn\9 n — 
0oo| < co. If I is an interval with w continuous on I and ji a {I) — 0, then these 
results hold uniformly in I. 

Remark. It is possible (see remarks in Section 4.6 of [68]) that (|17.4[) holds if 
a Szego condition is replaced by w(9) > for a.e. 8. Indeed, under that hypothesis, 
Simon |88j proved that 



There have been significant extensions of Theorem 117.21 to OPRL on fairly 
general sets: 

1. [66] used the idea of Nevai trial functions (Section [T6|) to prove the Szego 
condition could be replaced by regularity plus a local Szego condition. 

2. [66j used the Szego mapping to get a result for [—1, 1]. 

3. Using polynomial mappings (see Section [18]) plus approximation, Totik [96 
proved a general result (see below); one can replace polynomial mappings by 
Floquet-Jost solutions (see Section IT^j) in the case of continuous weights on an 
interval (see [87] ). 

Here is Totik's general result (extended from a(d[i) C e to a css (d^) C e): 




lim nA„(6' 00 ) = w(8 oa ) 



(17.4) 
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Theorem 17.3 (Totik [96l US])- Let e be a compact subset ofR. Let I C e be 
an interval. Let d/i have a ess (dfi) = e be regular for t with 



J log(io) dx > — oo 



(17.5) 



Then for a.e. Xao € 7, 



lim - X ra (xoo, Zoo) = Pe f (17-6) 

ri— >oo n W^Xoo) 

The same limit holds for ^K n (x n ,x n ) i/sup„n|a;„ — Xoo\ < oo. If /i s (I) — and 
w is continuous and nonvanishing on I, then those limits are uniform on Xoo G I 
and on all x n 's with sup„ n\x n — x^l < A (uniform for each fixed A). 

Remarks. 1. Totik |98j recently proved asymptotic results for suitable CD 
kernels for OPs which are neither OPUC nor OPRL. 

2. The extension to general compact e without an assumption of regularity for 
the Dirichlet problem is in 



18. Variational Principle: Polynomial Maps 

In passing from [—1,1] to fairly general sets, one uses a three-step process. A 
finite gap set is an e of the form 

t = [ati,Pi]U[a 2 ,l3 st ]U---U[at + i,0t+i] (18-1) 

where 

ai < fa < a 2 < (3 2 < ■ ■ ■ < a e+1 < (3 e+1 (18.2) 

£f will denote the family of finite gap sets. We write e = ei U • • • U e^+i in this 
case with the tj closed disjoint intervals. £ p will denote the set of what we called 
periodic finite gap sets in Section [TD] — ones where each tj has rational harmonic 
measure. Here are the three steps: 

(1) Extend to e S £ p using the methods discussed briefly below. 

(2) Prove that given any e € £/, there is e^™- 1 € £ p , each with the same number 
of bands so tj C tf ] C tf~ l) and n„4 n) = tj. This is a result proven 
independently by Bogatyrev [12) . Peherstorfer |71j . and Totik |97j : see |89j 
for a presentation of Totik's method. 

(3) Note that for any compact e, if e^" 1 - 1 = {x \ dist(x, e) < — }, then is a finite 
gap set and e = n m ^ m \ 

Step (1) is the subtle step in extending theorems: Given the Bogatyrev- 
Peherstorfer-Totik theorem, the extensions are simple approximation. 

The key to e € £ p is that there is a polynomial A: C — > C, so A _1 ([— 1, 1]) = e 
and so that tj is a finite union of intervals with disjoint interiors so that A is a 
bijection from each e& to [—1,1]. That this could be useful was noted initially by 
Geronimo-Van Assche [36 . Totik showed how to prove Theorem 117.31 for t E £ p 
from the results for [—1, 1] using this polynomial mapping. 

For spectral theorists, the polynomial A = ^A where A is the discriminant 
for the associated periodic problem (see [431 1531 11041 1951 [89]). There is a direct 
construction of A by Aptekarev [4] and Peherstorfer [TOl ITO IT2"] . 
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19. Floquet Jost Solutions for Periodic Jacobi Matrices 

As we saw in Section [TBI models with appropriate behavior are useful input for 
comparison theorems. Periodic Jacobi matrices have OPs for which one can study 
the CD kernel and its asymptotics. The two main results concern diagonal and just 
off-diagonal behavior: 

Theorem 19.1. Let /i be the spectral measure associated to a periodic Jacobi 
matrix with essential spectrum, e, a finite gap set. Let d/i — w(x) dx on e {there 
can also be up to one eigenvalue in each gap). Then uniformly for x in compact 
subsets of e mt , 

i Kn (x,x)^^l ( 19 -!) 

and uniformly for such x and a, b in K with \a\ < A, \b\ < B, 

K n (x + ±x + ±) sm(n Pt (x)(b-a)) 
K n {x,x) np e (x)(b — a) 

Remarks. 1. (| 19 . 2[) is often called bulk universality. On bounded intervals, it 
goes back to random matrix theory. The best results using Riemann-Hilbert meth- 
ods for OPs is due to Kuijlaars-Vanlessen [51] . A different behavior is expected at 
the edge of the spectrum — we will not discuss this in detail, but see Lubinsky 62j. 

2. For [—1,1], Lubinsky 60J used Legendre polynomials as his model. The 
references for the proofs here are Simon [871 189|. 

The key to the proof of Theorem 1 19. II is to use Floquet- Jost solutions, that is, 
solutions of 

a n u n+ i + b n u n + a„_iw n _i = xu n (19.3) 
for n € Z where {a n , b n } are extended periodically to all of Z. These solutions obey 

u n+P = e i9 ^u n (19.4) 

For x € e lnt , u n and u n are linearly independent, and so one can write p.-± in terms 
of u. and u.. Using 

1 d6 
Ptix) = — — 
pit dx 

one can prove (|19.1[) and (|19.2j) . The details are in [5TI 

20. Lubinsky's Inequality and Bulk Universality 

Lubinsky |60j found a powerful tool for going from diagonal control of the CD 
kernel to slightly off-diagonal control — a simple inequality. 

Theorem 20.1. Let n < pf and let K ni K* be their CD kernels. Then for any 

\K n (z,() - K(z,()\ 2 < K n (z,z)lK n ((,() ~ K((,()} (20.1) 
Remark. Recall (Theorem [Of that K n ((,() > K*(C,0- 
Proof. Since K n — K* is a polynomial z of degree n: 

K n {z,Q-K*{z,Q - / K n (z,w)[K n (w,Q - K*(w,()}dfi(w) (20.2) 



(19.5) 
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By the reproducing kernel formula (|1.19p , we get (|20.1|) from the Schwarz inequality 
if we show 

J \K n (w,0-K(™,0\ 2 dri™)<Kn((,0-K((,0 (20.3) 

Expanding the square, the K\ term is K n (£, £) by (| 1 . 1 9(1 and the K n K^ cross 
term is — 2K^((, £) by the reproducing property of K n for d/j, integrals. Thus, (|20.3|) 
is equivalent to 

V:(^,oi 2 ^M<^(c,o (20.4) 



This in turn follows from \x < fi* and (| 1 . 19(1 for p*\ □ 

This result lets one go from diagonal control on measures to off-diagonal. Given 
any pair of measures, and v, there is a unique measure fi V v which is their least 
upper bound (see, e.g., Doob |30j). It is known (see [85] ) that if fi, v are regular 
for the same set, so is /i V v. (|20. 1|) immediately implies that (go from /i to fi* and 
then /J* to v): 

Corollary 20.2. Let /x, v be two measures and fi* = /J, V v. Suppose for some 
z„ — > Zoo, w-n — * Zoo, we have for n = /i, v, //* that 

K n (z n ,z n ;r)) _ K n (w n ,w n ;r)) _ 1 



and that 



Then 



woo K n (Zoo,Zoo;rj) n ^°° K n (zoo,Zoo;v) 

i. K n {Zoo , Zoo I M) ^"n(^ooi ^oo; ^) 

n^oo K n (Zoo,Zoo;H*) ™^°° ^n^oc^ooSM*) 



K n (z n ,w n ;^) 
n-^oo K n (z n ,w n ;v) 

Remark. It is for use with x n = Xoo + % or + that we added x n — > Xoo 
to the various diagonal kernel results. This "wiggle" in Xoo was introduced by 
Lubinsky [60] . so we dub it the "Lubinsky wiggle." 

Given Totik's theorem (Theorem I17.3|) and bulk universality for suitable mod- 
els, one thus gets: 

Theorem 20.3. Under the hypotheses of Theorem \17.3\ for a.e. Xoo in I, we 
have uniformly for \a\, \b\ < A, 

j. K n (xoo + f^Xpo + |) _ sin(7T/3 e (x 00 )(b- a)) 

n->oo Kn^XocXoo) Kp ( (Xoo){b - a) 

Remarks. 1. For e = [— 1, 1], the result and method are from Lubinsky |60j. 
2. For continuous weights, this is in Simon [87j and Totik [99] . and for general 
weights, in Totik [99] . 

21. Derivatives of CD Kernels 

The ideas in this section come from a paper in preparation with Avila and 
Last [6] . Variation of parameters is a standard technique in ODE theory and used 
as an especially powerful tool in spectral theory by Gilbert-Pearson |38j and in 
Jacobi matrix spectral theory by Khan-Pearson |49j . It was then developed by 
Jitomirskaya-Last |44l 1451 146] and Killip-Kiselev-Last |50j , from which we take 
Proposition 121.11 
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Proposition 21.1. For any x,xq, we have 

n-l 

Pn{x)-p n {x ) = (x-xq) (p w (xp ) q m (x ) - p m (x )q n (x ))p m (x) (21.1) 
In particular, 

n-l 

Pn( x o) = ^2 (Pn{xo)q m (x ) - p m (x )q n (xo))p m (xo) (21.2) 

m=0 

Here q n are the second kind polynomials denned in Section [H For (|21.1| , see 
[HI HI 1151 [50]. This immediately implies: 

Corollary 21.2 (Avila-Last-Simon [6j). 



d 1 

da n 



K n [ x Q 



a a 
- , x + - 
n n 



3=0 



= — ^2 Pj( x ") 2 [ ^2Pk(xo)qk{xo)j - q J (xo)Pj(x )( ^Pfe(xo) 



k=0 



k=0 



(21.3) 



This formula gives an indication of why (as we see in the next section is im- 
portant) lim ^K n (xp + z-, xo + -) has a chance to be independent of a if one notes 
the following fact: 

Lemma 21.3. If {a n }'^' =1 and {/3 n }n° = i o>re sequences so that lim X^=i a « = 
A and lim Y,n=i Pn = B exist and sxvp N [jj ^n=il a ™l + \Pn\] < oo, th 



en 



wElfath) ~ fat *) 

7 = 1 L v k=l ' V k=\ ' 



(21.4) 



This is because 



N j 



N 2 

3=1 k=l 

Setting otj = pj(xp) 2 and (3j = pj{xo)qj{xp), one can hope to use (|21.4[) to prove 
the right side of (|21.3[) goes to zero. 

22. Lubinsky's Second Approach 



Lubinsky revolutionized the study of universality in [60] , introducing the ap- 
proach we described in Section [201 While Totik |99| and Simon [87j used those 
ideas to extend beyond the case of e = [— 1, 1] treated in [60) . Lubinsky developed 
a totally different approach [61] to go beyond [60] , That approach, as abstracted 
in Avila-Last-Simon [6J, is discussed in this section. Here is an abstract theorem: 

Theorem 22.1. Let dfj, be a measure of compact support on M. Let xq be a 
Lebesgue point for jj, and suppose that 
(i) For any e, there is a C e so that for any R, we have an N(e, R) so that for 
n > N(e,R), 

- K n (x + -,x + - ) < C £ e £|z|2 (22.1) 
n \ n n ) 

for all z G C with \z\ < R. 
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(ii) Uniformly for real a's in compact subsets of I 



K n (x + - ,x + -) 
lim ny " ' u " ; = i (22.2) 

n-»oo K n (Xo,Xo) 



Let 

w(x ) 

Pn = 

n 

Then uniformly for z, w in compact subsets of C, 



K n {x ,x a ) (22.3) 



K n (xo + T ^,xo + ^ : ) S m(n(z-w)) 
lim — '— = — — (22.4) 

n— oo K n (x ,x ) n(z-w) 

Remarks. 1. If p n — > p e (xo), the density of the equilibrium measure, then 
(122. 4j) is the same as (119.21) . In every case where Theorem 122.11 has been proven to 
be applicable (see below), p n — * p t (xo). But one of the interesting aspects of this is 
that it might apply in cases where p n does not have a limit. For an example with 
a.c. spectrum but where the density of zeros has multiple limits, see Example 5.8 
of [85]. 

2. Lubinsky [61] worked in a situation (namely, xo in an interval / with w(xq) > 
c > on I) where (|22.1[) holds in the stronger form Ce D l 2 ' (no square on |z|) and 
used arguments that rely on this. Avila-Last-Simon [Bj found the result stated 
here; the methods seem incapable of working with (|22.ip for a fixed e rather than 
all e (see Remark 1 after Theorem 122. 2p . 

Let us sketch the main ideas in the proof of Theorem 122.11 

(1) By (TCTi), 

liminf - K n (x , x ) > (22.5) 
n 

(2) By the Schwarz inequality (|1.14p . (|22.ip . and (|22.5p . and by the compact- 
ness of normal families, we can find subsequences n(j) so 



F(z,w) (22.6) 



and F is analytic in w and anti- analytic in z. 

(3) Note that by (|22~2|) and the Schwarz inequality (fl~T4|) . we have for a, b g K, 

F(a,a) = l |F(o,6)|<l (22.7) 

By compactness, if we show any such limiting F is sin(7r(^ — w))/{z — w), we have 
(|22.4p . By analyticity, it suffices to prove this for z = a real, and we will give details 
when z = 0, that is, we consider 

/(0) = 1 \f(x)\ < 1 for x real 

K n (x , x + a)\ 2 w(a) da < K n (x 0l x ) (22.9) 



([2"2~T]) becomes 
(4) By (Trill), 
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which, by using the fact that xq is Lebesgue point, can be used to show 

\f(x)\ 2 dx<l (22.10) 



(5) By properties of K n (see Section |6]) and Hurwitz's theorem, / has zeros 
{ x j}jL-oo j^o om y on ^> which we label by 

• ■ • < x-i < < xi < x 2 < ■ ■ ■ (22.11) 

and define xq — 0. By Theorem 17.21 using (|22.2|) . we have for any j, k that 

\xj-x k \>\j-k\-l (22.12) 

(6) Given these facts, the theorem is reduced to 
Theorem 22.2. Let f be an entire function obeying 

(1) 

/(0) = 1 |/(af)| < 1 for x real (22.13) 

(2) 

\f{x)\ 2 dx<\ (22.14) 



(3) / is real on R, has only real zeros, and if they are labelled by (|22.11|) . then 
(|22.12j) holds. 

(4) For any e, there is a C' e so 

\f(z)\ < C £ e^ 2 (22.15) 

Then 

m = ^ (22.16) 

Remarks. 1. There exist examples (\6) e~ az2+bz simrz/nz that obey (l)-(3) 
and (|22. 15[) for some but not all e. 

2. We sketch the proof of this in case one has 

\f{z)\<CeP\'\ (22.17) 
instead of (|22.15[k see [6] for the general case. 

Lemma 22.3. If (l)-(3) hold and (|22. 17|) holds, then for any e > 0, there is 
D £ with 

\f(z)\ < D ee (T+e)Un»*| (22.18) 
Sketch. By the Hadamard product formula [2], 

where D is real since / is real on R. Thus, for y real, 

y 2 



By |m, \ Xj \>j-l,BQ 



j*0 " T 3 



3=1 



2 
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which, given Euler's formula for sinirz/z, implies (122. 18j) for z = iy. By a 
Phragmen-Lindelof argument, (|22.18|) for z real and for z pure imaginary and 
(|22.17p implies (|22.18p for all z. 

Thus, Theorem [22721 ( under hypothesis l|22.17|) ) is implied by: 

Lemma 22.4. /// is an entire function that obeys (|22. 13[) . (|22.14p . and (|22.18p . 
then (|22.16|) holds. 

Proof. Let / be the Fourier transform of /, that is, 

f(k) = (27T)- 1 / 2 / e- lkx f{x)dx (22.19) 



(in L 2 limit sense). By the Paley- Wiener Theorem [74], (|22.18|) implies / is sup- 
ported on [-7r,7r]. By (|22.14|) . 

||/|| L 2 = \\(2ti)- 1 I 2 X[ ^ m \\ l , = 1 

and, by (|22.13|) and support property of /, 

</,(2vr)- 1 / 2 Xh _ ] ) = l 

We thus have equality in the Schwarz inequality, so 

/ = (27t)-^ 2 XI -, M 

which implies (122. 16j) . □ 

This theorem has been applied in two ways: 

(a) Lubinsky |61j noted that one can recover Theorem l20.3l from iust Totik's result 
Theorem 117.31 without using the Lubinsky wiggle or Lubinsky 's inequality. 

(b) Avila-Last-Simon [6] have used this result to prove universality for ergodic 
Jacobi matrices with a.c. spectrum where e can be a positive measure Cantor 
set. 

23. Zeros: The Freud Levin Lubinsky Argument 



In the final section of his book [34] , Freud proved bulk universality under fairly 
strong hypotheses on measures on [—1, 1] and noticed that it implied a strong result 
on local equal spacings of zeros. Without knowing of Freud's work, Simon, in a 
series of papers (one joint with Last) |82|, I83|, I84L 157) . focused on this behavior, 
called it clock spacing, and proved it in a variety of situations (not using universality 
or the CD kernel). After Lubinsky 's work on universality, Levin [59] rediscovered 
Freud's argument and Levin-Lubinsky [59] used this to obtain clock behavior in a 
very general context. Here is an abstract version of their result: 

Theorem 23.1. Let \x be a measure of compact support on R; let xq S c(/z) be 
such that for each A, for some c n , 

K n ^(x + ^-,x + ^-) ^ sm(7r(6-a)) 
K n _ 1 (x ,x ) 7r(6-a) 

uniformly for real a,b with \a\, \b\ < A. Let x^ n \xo) denote the zeros of p n (x; dfj.) 
labelled so 

■ ■ ■< x^l(x ) <x < x ( n) (x a ) < x[ n \x ) < ■ ■■ (23.2) 

Then 
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(1) 

limsup nc n {x^ — xq) < 1 (23.3) 

(ii) For any J, for large n, there are zeros x^ for all j G {— J, — J+l, . . . , J— 1, J}. 
(iii) 

lira — %^)nc n = 1 for each j (23.4) 

Remarks. 1. The meaning of x^ has changed slightly from Section [51 

2. Only nc n enters, so the "n" could be suppressed; we include it because 
one expects c„ as defined to be bounded above and below. Indeed, in all known 
cases, c„ — * p(xo), the derivative of the density of states. But see Remark 1 after 
Theorem 122.11 for cases where c„ might not have a limit. 

3. See iSS for the OPUC case. 



PROOF. Let x^Ixq) be the zeros of p n { x )Pn-i{ x o) — Pn( x o)Pn-i( x ) labelled 

as in (|23.2p (with Xq™^(xo) = xq). By (|23.1[) . we have x±l(xo)nc n — + 1 since 
sin(7ra)/a is nonvanishing on (—1,1) and vanishes at ±1. The same argument 
shows K n (x^i,x±-l + b/nc n ) is nonvanishing for \b\ < i, and so there is at most 
one zero near x±i on l/nc n scale. It follows by repeating this argument that 

nc„^ n) — > j (23.5) 

for all j . 

Since we have (see Section [6]) that 

^ (") i \ ^ -(") i \ 
x < x x '(xq) < x\ '(xo) 

by interlacing, which implies (i) and similar interlacing gives (ii). Finally, (|23.4D 
follows from the same argument that led to (|23.5p . □ 

24. Adding Point Masses 

We end with a final result involving CD kernels — a formula of Geronimus [371 
formula (3.30)]. While he states it only for OPUC, his proof works for any measure 
on C with finite moments. Let /i be such a measure, let zq € C, and let 

v = n + XS t0 (24.1) 

for A real and bigger than or equal to — /j,({zq}). 

Since X n (z; dv) and X„(z; dfi) are both monic, their difference is a polynomial 
of degree n — 1, so 

n-l 

X n (x;dv) — X n (z; dfi) + CjXj(z; dfi) (24.2) 

3=0 

where 



j= Xj(z;dfi)[X n (z;di/) - X n (z;dfi)]dfi (24.3) 



Xj(z; d/i) X n (z; dv)[dv — X5 Z0 ] (24.4) 



= -\Xj(zo;dfi)X n (z ;dL>) (24.5) 



3(5 
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where (|24.4[) follows from Xj(-,dfi) _L X n (- ,d(j,) in L 2 (dfi) and (|24.5|) from 
Xj( • , d/i) _L X n ( ■ , di>) in L (dv). Thus, 

X n (z;dv) — X n (z; dfi) - XX n (z Q ; dv)K n _i(z , z; dfi) (24.6) 

Set z — zq and solve for X n (zo; dv) to get: 

Theorem 24.1 (Geronimus [37]). Let \i,v be related by (|24.1|) . T/ien 

X„(z; di/) = X n (z; dpi) t -rpr — (24.7) 

1 + XK n -i(z , z ] dfi) 

This formula was rediscovered by Nevai [67j for OPRL, by Cachafeiro- 
Marcellan [HI El], Simon [UJ (in a weak form), and Wong [l05l [TUB] for OPUC. 
For general measures on C, the formula is from Cachafeiro-Marcellan [171 118] , In 
particular, in the context of OPUC, Wong [106] noted that one can use the CD 
formula to obtain: 

Theorem 24.2 (Wong [T05l[T06] ). Let dfi be a probability measure on <9B and 
let dv be given by 

dv = d Jt±^L (2 4. 8 ) 
1 + A y ' 

for zq € <9B and A > — fi({z Q }). Then 

(1-K(d^)| 2 ) 1/2 



a n {dv) = a n (dfi) + tp n+1 (z ) tp* n {z ) (24.9) 

A + K n {zo, z ; dfi) 

Proof. Let 

Qn = A -1 + K n (zo, z: dpt) (24.10) 

Since 

$ n+ i(z;d!>) = $„+i(z;d^) (24.11) 

and 

a„(di>) = -$„+i (0;dP) (24.12) 

([24~7]) becomes 



a n (dv) - a n (dfi) = Q n x $ n+ i(z a ) K n (z o ,0] dfi) (24.13) 
By the CD formula in the form (13.23p . 



/f„(zo,0) = ^(z o )^(0) (24.14) 

" W 

since $*(0) = 1 and ||$*|| = ||$„||. (|2T9"]I then follows from ||$ n+1 ||/||$„|| = 
(1-K| 2 ) 1/2 . □ 

To see a typical application: 

Corollary 24.3. Let zq be an isolated pure point of a measure dfi on <9D. Let 
dv be given by (j24.8|> where A > — (i({zq}) (so zq is also a pure point of dv). Then 
for some D,C > 0, 

\a n (dv) - a n (dfi)\ < De~ Cn (24.16) 
Proof. By Theorem 10.14.2 of [UJ, 

\p n (z ;dfi)\ <D ie ~i Cn (24.17) 
This plus (|24~9f implies |24~T6]) . □ 
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This is not only true for OPUC but also for OPRL: 

Corollary 24.4. Let zq be an isolated pure point of a measure of compact 
support dfi on R. Let dv be given by (|24. 8[) where A > — /i({zo}) so z is also a pure 
point of dv. Then for some D,C > 0, 



(i) 



U^)_ (1 + A)1/2 



< De- Cn (24.18) 



K n (d(j,) 

(ii) \\Pn{->dv)-{l + \) 1 / 2 p n {-,d l i)\\ L z {dv) <De- Cn (24.19) 

(iii) \a n {dv) - a n (dfi)\ < De~ Cn (24.20) 
\b n (dv) - b n {dn)\ < De~ Cn (24.21) 

Sketch. Isolated points in the spectrum of Jacobi matrices obey 

\pn(z )\ < D ie ~ Cin (24.22) 

for suitable Ci,D x (see [TII25]). 

(|24.7|) can be rewritten for OPRL 

K n (d/j,)P n (x;du) = p n (x;dn) - Xp n (x ; dp:) - Kn ~ 1 ^ x ^ x ^ d l i ) (24.23) 

1 + XK n (x , x ; dp,) 

Since 

WKn^^xo, x; dfi)\\ 2 L 2 idll ) = K n - 1 (x ,x ;dfi) 

and 

[\K n -i(x ,x ;dii)\ 2 dS Xo = K n -i(x , x ; d^) 2 

and K n _x(xo, Xq) is bounded (by (|24.22p ). we see that \\K n _i(xo; • ; ^A t )||i 2 (diy) is 
bounded. Thus, by (|24~22|) and (|24~23f . 

Knid^Kjdv)- 1 = (1 + \)- 1/2 + 0{e~ Cin ) 

which leads to (|24.18p . 

This in turn leads to (ii), and that to (iii) via (|24.22[) . and, for example, 

a n (dp) = / xp n (x; dp)p n -i(x; dp) dp, (24.24) 



a n (dv) = J xp n (x; dv)p n ^i(x; dv) dv (24.25) 

□ 

This shows what happens if the weight of an isolated eigenvalue changes. What 
happens if an isolated eigenvalue is totally removed is much more subtle — sometimes 
it is exponentially small, sometimes not. This is studied by Wong [107] . 
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